Nairobi Province
Data Flows and Colonial Regimes in Africa: A Critical Analysis of the Colonial Futurities Embedded in AI Ecosystems
A, Ndaka., F, Avila-Acosta., H, Mbula-Ndaka., C, Amera., S, Chauke., E, Majiwa.
Data Flows and Colonial Regimes in Africa: A Critical Analysis of the Colonial Futurities Embedded in AI Recommendation Algorithms Angella Ndaka, University of Witwatersrand, Johannesburg, South Africa Fátima Ávila - Acosta, Berlin Graduate School of Social Sciences at Humboldt University, Berlin, Germany Harnred Mbula, Centre for Epistemic Justice, Nairobi, Kenya Christine Amera, Centre for Epistemic Justice, Nairobi Kenya Sandra Tiyani Chauke University of Pretoria, South Africa Eucabeth Majiwa Jomo Kenyatta University of Agriculture and Technology, Nairobi, Kenya Abstract In the last few years, Africa has experienced growth in a thriving ecosystem of Artificial Intelligence (AI) technologies and systems, developed and promoted by both local and global technology players. While the sociotechnical imaginaries about these syst ems promote AI as critical to achiev ing Africa's sustainable development agenda, some of them have subtly permeated society, recreating new values, cultures, practices, and histories that threaten to marginalize minority groups in the region. Africa predominantly frames AI as an imaginary solution to address complex social challenges; however, the narrative subtly ignores deeper power - related concerns, including data governance, embedded algorithmic colonialism, and the exploitation that propag ates new digital colonial sites. However, the development of current AI ethics in Africa is in its infancy and predominantly framed through lenses of Western perspective, with the social and ethical impacts of the AI innovations and application on African epistemologies and worldviews not prioritized. To ensure that people on the African continent leverage the benefits of AI, these social and ethical impacts o f AI need to be critically and explicitly considered and addressed. This chapter will therefore seek to frame the elemental and invisible problems of AI and big data in the African context by examining digital sites and infrastructure through the lens of power and interests. It will present reflections on how these sites are using AI recommendation algorithms to recreate new digital societies in the region, how they have the potential to propagate algorithmic colonialism and negative gender norms, and what this means for the regional sustainable development agenda. The chapter proposes adopting business models that embrace response - ability and consider the existence of alternative socio - material worlds of AI. These reflections will mainly come from ongoing discussions with Kenyan social media users in this author's user space talks, which take place every month. Keywords: Artificial Intelligence; algorithmic colonialism; Data; response - ability; digital sites Section 1: Introduction The growing global interest, combined with rising investments in AI skilling and infrastructure development, is a key driver of the expanding landscape of AI technologies and systems across Africa.
Europe Pledges 600 Billion for Clean Energy Projects in Africa
The EU's Global Gateway plan is challenging China's Belt and Road Initiative to influence Africa, by providing funding that will expand access to electricity. Nearly 600 million Africans--half the continent's population--are without electricity, largely because of the continent's limited distribution network, and Africans make up the vast majority of those worldwide without electricity access. But the European Union wants to change this. At the end of September, the president of the European Commission, Ursula von der Leyen, announced a €545 million ($636 million) investment package to support renewable energy and electrification in Africa. New EU-funded projects will include a high-voltage transmission line in Côte d'Ivoire, the electrification of hundreds of rural communities in Cameroon, the exploitation of wind and hydro energy in Lesotho, and the installation of mini-grids in remote areas of Madagascar.
AI-based Clinical Decision Support for Primary Care: A Real-World Study
Korom, Robert, Kiptinness, Sarah, Adan, Najib, Said, Kassim, Ithuli, Catherine, Rotich, Oliver, Kimani, Boniface, King'ori, Irene, Kamau, Stellah, Atemba, Elizabeth, Aden, Muna, Bowman, Preston, Sharman, Michael, Hicks, Rebecca Soskin, Distler, Rebecca, Heidecke, Johannes, Arora, Rahul K., Singhal, Karan
We evaluate the impact of large language model-based clinical decision support in live care. In partnership with Penda Health, a network of primary care clinics in Nairobi, Kenya, we studied AI Consult, a tool that serves as a safety net for clinicians by identifying potential documentation and clinical decision-making errors. AI Consult integrates into clinician workflows, activating only when needed and preserving clinician autonomy. We conducted a quality improvement study, comparing outcomes for 39,849 patient visits performed by clinicians with or without access to AI Consult across 15 clinics. Visits were rated by independent physicians to identify clinical errors. Clinicians with access to AI Consult made relatively fewer errors: 16% fewer diagnostic errors and 13% fewer treatment errors. In absolute terms, the introduction of AI Consult would avert diagnostic errors in 22,000 visits and treatment errors in 29,000 visits annually at Penda alone. In a survey of clinicians with AI Consult, all clinicians said that AI Consult improved the quality of care they delivered, with 75% saying the effect was "substantial". These results required a clinical workflow-aligned AI Consult implementation and active deployment to encourage clinician uptake. We hope this study demonstrates the potential for LLM-based clinical decision support tools to reduce errors in real-world settings and provides a practical framework for advancing responsible adoption.
Dukawalla: Voice Interfaces for Small Businesses in Africa
Ankrah, Elizabeth, Nyairo, Stephanie, Muchai, Mercy, Awori, Kagonya, Ochieng, Millicent, Kariuki, Mark, O'Neill, Jacki
Small and medium sized businesses often struggle with data driven decision making do to a lack of advanced analytics tools, especially in African countries where they make up a majority of the workforce. Though many tools exist they are not designed to fit into the ways of working of SMB workers who are mobile first, have limited time to learn new workflows, and for whom social and business are tightly coupled. To address this, the Dukawalla prototype was created. This intelligent assistant bridges the gap between raw business data, and actionable insights by leveraging voice interaction and the power of generative AI. Dukawalla provides an intuitive way for business owners to interact with their data, aiding in informed decision making. This paper examines Dukawalla's deployment across SMBs in Nairobi, focusing on their experiences using this voice based assistant to streamline data collection and provide business insights
Robustness and Cybersecurity in the EU Artificial Intelligence Act
Nolte, Henrik, Rateike, Miriam, Finck, Michèle
The EU Artificial Intelligence Act (AIA) establishes different legal principles for different types of AI systems. While prior work has sought to clarify some of these principles, little attention has been paid to robustness and cybersecurity. This paper aims to fill this gap. We identify legal challenges and shortcomings in provisions related to robustness and cybersecurity for high-risk AI systems (Art. 15 AIA) and general-purpose AI models (Art. 55 AIA). We show that robustness and cybersecurity demand resilience against performance disruptions. Furthermore, we assess potential challenges in implementing these provisions in light of recent advancements in the machine learning (ML) literature. Our analysis informs efforts to develop harmonized standards, guidelines by the European Commission, as well as benchmarks and measurement methodologies under Art. 15(2) AIA. With this, we seek to bridge the gap between legal terminology and ML research, fostering a better alignment between research and implementation efforts.
RideKE: Leveraging Low-Resource, User-Generated Twitter Content for Sentiment and Emotion Detection in Kenyan Code-Switched Dataset
Etori, Naome A., Gini, Maria L.
Social media has become a crucial open-access platform for individuals to express opinions and share experiences. However, leveraging low-resource language data from Twitter is challenging due to scarce, poor-quality content and the major variations in language use, such as slang and code-switching. Identifying tweets in these languages can be difficult as Twitter primarily supports high-resource languages. We analyze Kenyan code-switched data and evaluate four state-of-the-art (SOTA) transformer-based pretrained models for sentiment and emotion classification, using supervised and semi-supervised methods. We detail the methodology behind data collection and annotation, and the challenges encountered during the data curation phase. Our results show that XLM-R outperforms other models; for sentiment analysis, XLM-R supervised model achieves the highest accuracy (69.2\%) and F1 score (66.1\%), XLM-R semi-supervised (67.2\% accuracy, 64.1\% F1 score). In emotion analysis, DistilBERT supervised leads in accuracy (59.8\%) and F1 score (31\%), mBERT semi-supervised (accuracy (59\% and F1 score 26.5\%). AfriBERTa models show the lowest accuracy and F1 scores. All models tend to predict neutral sentiment, with Afri-BERT showing the highest bias and unique sensitivity to empathy emotion. https://github.com/NEtori21/Ride_hailing
Building low-resource African language corpora: A case study of Kidawida, Kalenjin and Dholuo
Mbogho, Audrey, Awuor, Quin, Kipkebut, Andrew, Wanzare, Lilian, Oloo, Vivian
Natural Language Processing is a crucial frontier in artificial intelligence, with broad applications in many areas, including public health, agriculture, education, and commerce. However, due to the lack of substantial linguistic resources, many African languages remain underrepresented in this digital transformation. This paper presents a case study on the development of linguistic corpora for three under-resourced Kenyan languages, Kidaw'ida, Kalenjin, and Dholuo, with the aim of advancing natural language processing and linguistic research in African communities. Our project, which lasted one year, employed a selective crowd-sourcing methodology to collect text and speech data from native speakers of these languages. Data collection involved (1) recording conversations and translation of the resulting text into Kiswahili, thereby creating parallel corpora, and (2) reading and recording written texts to generate speech corpora. We made these resources freely accessible via open-research platforms, namely Zenodo for the parallel text corpora and Mozilla Common Voice for the speech datasets, thus facilitating ongoing contributions and access for developers to train models and develop Natural Language Processing applications. The project demonstrates how grassroots efforts in corpus building can support the inclusion of African languages in artificial intelligence innovations. In addition to filling resource gaps, these corpora are vital in promoting linguistic diversity and empowering local communities by enabling Natural Language Processing applications tailored to their needs. As African countries like Kenya increasingly embrace digital transformation, developing indigenous language resources becomes essential for inclusive growth. We encourage continued collaboration from native speakers and developers to expand and utilize these corpora.
Extractive Structures Learned in Pretraining Enable Generalization on Finetuned Facts
Feng, Jiahai, Russell, Stuart, Steinhardt, Jacob
Pretrained language models (LMs) can generalize to implications of facts that they are finetuned on. For example, if finetuned on ``John Doe lives in Tokyo," LMs can correctly answer ``What language do the people in John Doe's city speak?'' with ``Japanese''. However, little is known about the mechanisms that enable this generalization or how they are learned during pretraining. We introduce extractive structures as a framework for describing how components in LMs (e.g., MLPs or attention heads) coordinate to enable this generalization. The structures consist of informative components that store training facts as weight changes, and upstream and downstream extractive components that query and process the stored information to produce the correct implication. We hypothesize that extractive structures are learned during pretraining when encountering implications of previously known facts. This yields two predictions: a data ordering effect where extractive structures can be learned only if facts precede their implications, and a weight grafting effect where extractive structures can be transferred to predict counterfactual implications. We empirically demonstrate these phenomena in the OLMo-7b, Llama 3-8b, Gemma 2-9b, and Qwen 2-7b models. Of independent interest, our results also indicate that fact learning can occur at both early and late layers, which lead to different forms of generalization.